Kharagpur
a7c4163b33286261b24c72fd3d1707c9-Supplemental-Datasets_and_Benchmarks.pdf
These datasets enable large-scale study of abuse detection for these languages. Anonymized comments: To further address privacy concerns, we anonymize our dataset. We combine thehate and offensivecategories in these datasets for training a binary classification model. We showthepercentage (%)ofemoticons present inourdatasetMACDinTable12. Infuture work,we will investigate in detail about the impact of emoticons on abuse detection. However,duetothe limited scale and diversity of abuse detection datasets in Indic languages, development of these models for Indic languages has been severely impeded.
Industry:
- Law (0.47)
- Information Technology (0.34)
Country:
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
- North America > United States > Hawaii (0.04)
- (6 more...)
Industry:
- Government (1.00)
- Law (0.93)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.68)
- (2 more...)
Technology:
Country:
- Asia > Singapore (0.05)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > Germany > Bavaria > Regensburg (0.04)
- Asia > India > West Bengal > Kharagpur (0.04)
Industry:
- Information Technology > Security & Privacy (0.46)
- Health & Medicine > Therapeutic Area (0.46)
Technology:
Country:
- Asia > Singapore (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- (3 more...)
Industry:
- Health & Medicine (1.00)
- Information Technology > Security & Privacy (0.46)
Technology:
Country:
- North America > United States (0.46)
- Asia > India > West Bengal > Kharagpur (0.05)
- Asia > China (0.04)
- (7 more...)
Industry:
- Information Technology (1.00)
- Government (1.00)
- Energy (1.00)
- (5 more...)
Technology:
Country:
- Asia > China > Guangdong Province > Shenzhen (0.04)
- North America > United States > New York > Erie County > Amherst (0.04)
- Asia > India > West Bengal > Kharagpur (0.04)
- Asia > China > Heilongjiang Province > Harbin (0.04)
Industry:
- Information Technology > Security & Privacy (1.00)
- Law (0.67)
Technology:
Country:
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > China > Hong Kong (0.04)
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- (23 more...)
Industry:
- Health & Medicine > Therapeutic Area > Immunology (0.93)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.92)
- Education (0.67)
Technology:
Country:
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Switzerland > Zürich > Zürich (0.04)
- North America > United States > Michigan (0.04)
- (10 more...)
Industry:
- Education (0.46)
- Health & Medicine (0.46)
Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
- Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.93)
Country:
- Asia > Middle East > Jordan (0.04)
- Asia > India > West Bengal > Kharagpur (0.04)
Technology: